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1. Motivations 



In recent years, wavelet thresholding procedures have been widely applied to 
the field of nonparametric function estimation. They excel in the areas of spa- 
tial adaptivity, computational efficiency and asymptotic optimality. Among the 
various thresholding techniques studied in the literature , there are the term-by- 
term t hresholding (hard, soft, . . . ) initially developed bv lDonoho and Johnston^ 
(Il995h and the block thre shol ding (globa l , Blo ckShrink, . . . ) introduced by 



Kerkvacharian et al.l (|l996l) and lHall et all (|l999l ). 



Several recent works demonstrated that the block thresholding methods can 
enjoy better theoretical properties than the conventional term-by-term thresh- 
olding methods. This superiority has been proved for various st atist i cal m odels 
via the minimax approach un der the risk. See, for instance, Cai ( 199S) and 
Cavalier and Tsvbakov ( 2001 ) for the Ga ussian sequence model, Cai and ChickenI 



br the density estim ation. I Chicke n (2003) for the regression model with 
nonequispaced samples and ChickenT i 20071 ) for the regression model with ran- 
dom uniform design. 

This paper presents an extension of a result established by IChickenI (|2007l ). 
We prove that a generalized version of the BlockShrink construction achieves 
better rates of convergence than the conventional tcrm-by-term thresholding 
estimators. 
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The main contributions of this study concern the two foUowing points : 

- The model: we consider the regression model with (known) random design, 
not necessarily uniform. 

- The statistical approach: we adopt the minimax approach under the 
risk over Besov balls (regular, sparse and critical zones). The parameter p 
can be greater than or equal to 2. 

From a technical point of view, the proof is significantly more complicated than 
for the unif orm design and t he case p = 2. We combine a general theorem es- 
tablished bv lChesneau ( 20061 Theorem 4.2) with several probability inequalities 



such that the Talagrand inequality and the Borel inequality. 

The paper is organized as follows. Section[2]introduces the model, the adopted 
minimax approach, the wavelet bases and the considered estimator. Section [3] 
presents the main result while Section 0] contains a detailed proof of the main 
result. 



2. Model, Wavelet bases and Estimator 
2.1. The model 

We observe n pairs of random variables { {Xi , Fi ) , . . . , (X„ .Yn)} governed by 
the equation: 

Y, = f{X,) + z,, z = l,...,n, (2.1) 

where the design variables (Xi, . . . , Xn) are i.i.d. with Xi £ [0, 1], the variables 
(zi, . . . , Zn) are i.i.d. Gaussian with mean zero, variance one and are independent 
of (Xi, . . . , Xn)- We denote by g the density of Xi. The function / is unknown. 
The goal is to estimate / from the observations {{Xi, Fi), . . . , (X„, Yn)}- Addi- 
tional assumptions on the functions / and g will be specified latter (see Theorem 
O below). 

To estimate /, several adaptive methods have been elaborated (according 
to the nature o f the d esign) . See, for instance, the tr ansforma t ion m ethod of 



Cai and Brown 



998). the mo del selection method of BaraudI ( 20021 ) and the 



Gaiffad (l2006l) 



kernel method of 

In this study, we shall consider a particular wavelet thresholding estimator. 
For the sake of clarity, let us denote this estimator by /„. The performances of 
/„ will be measured under the global L^* risk defined by 

i?(/„, /) = mWfn - /lip = «} (yf^ \fn{t) ~ f{t)\''dt^ . 

Here, p is a real number greater than or equal to 2 and is the expectation 
with respect to the distribution of the observations {(Xi, Yi ),..., (X„, Yrt)}. 
The unknown regression function / is supposed to belong to a wide class of 
functions: the Besov balls. Wavelets and Besov balls are presented in the next 
subsection. 
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2.2. Wavelets and Besov balls 

We consider an orthonormal wavelet basis generated by dilation and transla- 
tion of a compactly supported "father" wavelet cf) and a compactly supported 
"mother" wavelet ip- For the purpose of this paper, we use the periodized wavelet 
bases on the unit interval. Let us set 

<l>j.k{x) = 2^/20(2% - k), ijj,k{x) = 2^'/2^(2Jx - k). 



And let us denote the periodized wavelets by (f>^'^^{x) — ~ 0) 

i^j^kix) = '^i^-z'4'j,k{x — I), X ^ [0,1]- Then, there exists an integer r such 
that the collection C, = {4>^rki^)^ k — . . . ,2'^ — 1; j = r, . . . , oo, k = 

0,...,2-' — 1} constitutes an orthonormal basis of L^([0, 1]). The superscript 
^^per" will be suppressed from the notations for convenience. 

For any integer I > t, a square-integrable function on [0, 1] can be expanded 
into a wavelet series 



2-1 

/(^) = X! oii,k(j}i,k{x) 

k=0 



oo 2^-1 

EE 

j=l k=0 



Pj.klJ-^j.k{x), 



where a^^k = f{x)4>j,k{x)dx and 0i h = Q f{x)^.i k{x)dx. 

For further details about wavelets, see iMeveij (|l99d ) and lCohen eTaP (|l993h . 

Since ip is compactly supported, the following property of concentration holds: 
for any m > 0, any j > t and any x £ [0, 1], there exists a constant C > 
satisfying 



J2 \^Mr <C2^""^^. 



(2.2) 



k=0 



Now, let us define the main sets of function considered in our statistical 
approach. Let M e (0, oo), s £ (0,oo), tt G [1,oo] and r € [l,oo]. Let us set 
Pt-im = ctr,k- We say that a function / belongs to the Besov balls ^.{M) 
if and only if there exists a constant AI* > such that the associated wavelet 
coefficients satisfy 



2^-1 



J2 [2J-(^+i/2-iA)[^ i^^.^i^jiAjr 

j = T-l 



k=0 



1/r 



< M* 



with the usual modification if g = cx). We work with the Besov balls because 
of their exceptional expressive power. For a particular choice of param eters s, 
TT and r, they contain the Holder and Sobolev balls (see, for instance, iMever 
(|l99(t ). 
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2.3. The estimator 



We are now in the position to describe the main esti mat o r of th e study. It is a 
version of the BlockShrink estimator developed by ICail for the Gaussian 

sequence modeL 

Let p £ [2,00), d e (0,00) and L be the integer L — [(logn)P/^J where the 
square brackets denote the floor function. Let ji and j2 be the integers defined 

by 

ji = L(p/2)log2(logn)J, 32 = L(l/2)log2 (n/logn)J. 

For any j £ {ji, . . . , ^2}, let us set Aj — {l^. . . , 2^L^^} and, for any K <E Aj, 

Bj^K = {A: e {0, . . . , 2^' - 1} : {K -l)L <k < KL- 1}. 

We define the (L^ version of the) BlockShrink estimator by 

2^1-1 32 

.fn{x)= ^ aj^,k(t)n,k{x)+Yl H H /3i,fel{fc^,^>d„-i/2}V'j,fe(2;), (2.3) 

fc=0 j=ji KeAj keBj,K 



where 6, k 



1/p 



and 



a,- fe = n 



'j,k 



-i^y,5(^0"V,,fe(^.)-(2.4) 



This estimator was first defined in this LP form bv lPicard and TriboulevI (|2000l ) 
for general statistical models. 



Comments: 

- The sets Aj and Bj^x are chosen such that DK^Aj^^j^K = {0, . . . , 2-' — 1}, 
Bj KnBjK' =0 for any K ^ K' with K, K' £ A^, and \Bj k\ = L = 
L(logn)P/2j. 

- It is easy to show that Uj^k and [3j^k are unbiased estimators of Uj^k and 
/3j_fe, the wavelet coefficients of /. Moreover, they satisfy several probability 
inequalities which will be at the heart of the proof of the main result. 
Further details concerning these inequalities are given in Section |4l 

- The considered BlockShrink estimator is adaptive since it does not depend 
on the smoothness of the unknown function /. However, it depends on 
the norm parameter p. An open question is : can we construct a block 
thresholding procedure that is adaptive to the risk for all p ? 



3. Main result 

Theorem 13.11 below determines the rates of convergence achieved by the Block- 
Shrink estimator under the risk over Besov balls. 
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Theorem 3.1. Let us consider the regression model with random design i2.1\) . 
Suppose that : 

• the unknown regression function f is bounded from above, i.e. ||/||oo < M' , 
where M' > denotes a known constant. 

• the density g of Xi is known and bounded from above and below. 

Let us consider the BlockShrink estimator fn defined by \2. S\) with a large enough 
threshold constant d. Let p £ [2,oo[. Then there exists a constant C > such 
that, for any tt G [1, oo], r G [1, oo], s G (I/tt + 1/2, oo) and n large enough, we 
have 

sup E] (\\U - fWl) < Cipn, 



where 



n 



"iP(logn)"iPi{p>'}, when e > 0, 

(logn/n)"2P(logn)(P-''/'')+i{'>=«}, when e < 0, 



withai = s/(2s+l), a2 = (s-l/7r+l/p)/(2(s-l/7r) + l) ande^ 7rs+2"i(7r-p). 



The rates of convergence presented in Theorem lS . 1 1 above are minimax except 
in the cases {p > tt} n {e > 0} and e = where there is an extra logarithmic 
term. They are better than those achieved by the conventional term-by-term 
thresholding estimators (hard, soft,...). The main difference is for the case 
{tt > p} where there is no extra logarithmic term. Let us mention that Theorem 
3.11 can be proved for p G (1, 2) if we consider the Bl ockShrink estimator (|2.3p 
defined with L — hin. Further details can be found in IChesneaul(|2006l ). Further 



details about the rates of convergence for the regression problem (12. ip via the 



minimax approach under the risk over Besov balls can be found in [Chesneau 



(|200<D . 

As mentioned in t he motivations of the paper. Theorem 13.11 is an extension 
of a result proved bv lChickenI (|2007L Theorem 2) for the uniform design, the L 



risk and the Holder balls Bl^ ,^{M). 

Comments on the choice of the thresholding constant d. From a theoretical 
point of view, it is difficult to determine the exact minimum value of d such that 
/„ achieves the rates of convergence exhibited in Thcorcm l3.1l In fact. Theorem 
13. II holds for d > fii where /ii refers to the constant of Proposition 14. II below. 



4. Proof of Theorem [3TT] 



Thanks to a result proved bv I Chesneau ( 2006 1. Theorem 4.2), the proof of Theo- 
rem [33] is an immediate consequence of Proposition |4T] below. This proposition 
shows that the estimators {(3j.k)k defined by ()2.4p satisfy a standard moments 
inequality and a specific concentration inequality. 
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Proposition 4.1. Let p > 2. There exist two constants /ii > and C > 

such that, for any j £ {ji, . . . , j2}, K E Aj and n large enough, the estimators 
{(3j^k)k defined by (2^ satisfy 

- the following moments condition : 

W}(\kk-Po.k?'')<Cn-P. (4.1) 

- the following concentration inequality : 

^1\[L-^ E l/3j,fc-/3j,fcn'/^>/ii2-in-i/M <4n-P. (4.2) 



The moments inequality has been proved bv lChesnead (|2007r ). The proof of 
the concentration inequahty (|4.2p combines several concentration inequalities 
such that the Talagrand inequality and the Borcl inequality. They are recalled 
in the two auxiliary lemmas below. 



Lemma 4.1 ( Talagrand (|l994l) ). Let (Vi, . . . , K) he i.i.d. random variables and 



(ei, . . . , e„) be independent Rademacher variables, also independent of {Vi , . . . , V„ 
Let T be a class of functions uniformly bounded by T. Let rn '■ J- R be the 
operator defined by: 

n 

Suppose that 

supVar{h{Vi)) <v, E ( sup V ei/i(V;) ] < nH. 
heJ^ \heJ^~l J 

Then, there exist two absolute constants C'l > and C2 > such that, for any 
t > 0, we have: 

sup r„(/i) > t + C2H] < cxp (-nCi {t^v~^ A tT^^)) . 



Lemma 4.2 (The Borel inequality (see Adleil (jlQQOf ))). Let V be a subset of 



Let {rit)t<=:V he a centered Gaussian process. Suppose that 

E I sup j < N, s\x'pVar{rit) < Q. 

Then, for any x > 0, we have 

P ( sup ?7t > x + iV ) < exp (-a;V(2Q)) . (4.3) 
\tev J 
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We are now in the position to prove Proposition 14.11 Here, C represents a 
constant which may be different from one term to the other. We suppose that 
n is large enough. 

Proof of the Proposition \^. 1\ By the definiton of Pj^ki we have the following 
decomposition 

where 

n 

Aj,k = 7^-i^/(X,).g(X.)'V,;fc(^i) -E'; (/(Xi).g(Xi)-Vj,fe(^i)) , (4.4) 



Bj,k = n-'J29i^iy'^j,kiXi)z^- (4-5) 
By the Zp-Minkowski inequality, for any /i > 0, we have 



fcGB,- 



where 



Let us investigate separately the upper bounds of K and V. 

• T/ie upper bound for lA. Our goal is to apply the Talagrand inequality 
described in Lemma |4?T] Let us consider the set Cq defined by 

= {a = (a,,fe) e Z*; ^ laj^fel" < l} 

and the functions class J- defined by 

T = h{x) = f{x)g{x)~^ ^ a.j^ktpj,k{x), a e Cgj. 

By an argument of duality, we have 

[E I^J.fel'']^^^ = ™P E ai,fc^j,fc = sup r„(/i), 
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where Aj^k is defined by (|4.4p and r„ denotes the function defined in Lemma 
14.11 Now, let us evaluate the parameters T, H and v of the Talagrand inequality. 

First of all, notice that, for p >2 (and, a fortiori, q — 1 + {p — < 2), an 
elementary inequality of Ip norm gives 

sup[ ^ Kk\T'<^MT. Kh\'']'^'<^- 



aeC 



— The value ofT. Let /i be a function in J-. By the Cauchy-Schwarz inequality, 
the assumptions of boundedness of / and g and the property of concentration 
([221), for any x G [0, 1], we find 



\hix)\ < \fix)\\9{x)r'[ |V^,,fe(a;)p]i/2sup[ ^ \a,,k\']'^' 

< ll/llco||l/5||co[ E \i^j.k{^)\'V^'<C2^^\ 

Hence T = (72^/^. 

— The value of H . The Zp-Holder inequality and the Holder inequality imply 



E 



sup ^ aj^kezf{X,)g{X,) Vj,fe(^i 



" 1=1 keBi 



< 



< 



sup[ Y Kkl']'^' 



aec 



1/p 



1/p 



keBj,K \ i=l / 



(4.6) 



Since (ei,...,e„) are independent Rademacher variables, also independent of 
X = {Xi, . . . , Xn), the Khintchine inequality yields 

(^\Yea{X,)giX,)-'^,^,{X,)\P^ 
= E] |^E}^(|fje,/(X,)5(^.)"V,,/c(X.)nX)^ 

CE] [ I J2 \f{X.)\MX^)n^,AX^)\'\^A = CL (4.7) 



< 



Now, let us consider the i.i.d. random variables {Ni, . . . , Nn) defined by 



N, = \f{X,)\'\g{X,)\-^\yjjAXr)\^ 



i — I, . . . ,n. 
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An elementary inequality of convexity implies / < 2p/^^^(/i + I2) where 

Let us analyze the upper bounds for Ii and /2, in turn. 

— The upper hound for Ii . The Rosenthal inequality applied to (iVi , . . . , Nn) 
and the Cauchy-Schwartz inequality imply 

h < c(nE^(|iVi-E';(iVi)|f/2) + („E';(|iVi-E}'(iVi)p))''/'') 
< C (nm\Ni\P/^) + {nm\Ni\^)Y'^'^ . 

For any to > 1, j G {ji, . . . , j^} and k e {0, . . . , 2-' — 1}, the assumptions of 
boundedness of / and g give 

Jo 

Jo 

Therefore h < Cn^l'^. 

— The upper bound for l2- Since Wj[Ni) < C, we have I2 < CnP^'^. 
Combining the obtained upper bounds for Ii and /2, we find 

I <C{Ii+l2) <CnP/^. (4.8) 

Putting (1331) and (g^) together, we see that 



/p 



Hence H = Cn-^/^^i/P. 

— r/ie value of v. By the assumptions of boundedness of / and g and the 
orthonormality of ^, we obtain 

sup Var{h{Xi)) 

< supE^ (|/(Xi)n5(^i)r'l E <^i.kr^iAXi)\' 

< ||/||LI11/5||ooSupeW J2 E '^j,ka,,k'9iX,)-'^P,MXi)^Pj.k'{X,) 

\keBj,K k'eB-i,K 



aec„ 



= C sup ^ ^ Oj^kajM' / ^pj,kix)ipj,k'ix)dx 

'^^'■^1 k£Bj,K k'£Bj,K " 

= Csup Kkl^^c. 



"^'^'J kGBj,K 
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Hence v = C. 

Now, let us notice that, for any j S {ji,---,i2}, we have < n2^'^ < 
2?i3/2(logn)~^/^ Since (logn)^/^ < L^I'p < 2^/P{\ogn)^^^ , iort = 8'^ fiL^/Pn-^/^ 
we have 



{t^v-^ AtT-^) > C i^^i^{logn/n) A^i^/{logn/{n2^))j > C/i^Qog n/n). 
So, for /i large enough and t = 8^^/iL^/'°n^^/^, the Talagrand inequality yields 

< F] (^sup rn{h) > t + C2H^ 

< exp(-nCi {t^y-^ AtT-^)) < cxp {-nCfi^ {log n/n)) < n^P. 
We obtain the desired upper bound for U. 

• The upper bound for V. Here, we apply the Borel inequality described in 
Lemma [4.21 Let us consider the set Cq defined by 

Cg = {a = {a,,k)eZ*; ^ Kkl' < 
and the process Z{a) defined by 

where Bj^k is defined by (|4.5p . Let us notice that, conditionally on X = [Xi, . . . , X„), 
Z[a) is a centered Gaussian process. Moreover, by an argument of duality, we 
have 

sup Z{a) = sup = [ \B3,kT]''''- 

aeCg aeCg f.^g.^ keBj^K 

Now, let us investigate separately the upper bounds for Ej(sup£(gc and 
sup,^C,Var]{Z{a)\X). 

~ The upper bound for Wj: (sup^^izc Z{a)[K). Let us consider the set Bf^ defined 

by 
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Let US work on the set B^, the complementary of B^. By the Jensen in- 
equality, the fact that Z{a) \ X - 7V(0, n'^ ^^^^^ \g{Xi)\-'^\il)j^k{Xi)\'^) and the 
assumptions of boundedness made on 51, we find 



[ supZ(a)|x| < [ J2 ^li\Bj,km) 



i/p 



< 



C\\1/9\U E in-'Y.9i^ir'\i'J,k{Xi)\'rV' 

n 

< Cn-'l\ E ("-^E5(^i)-'lV'.-,.(^OI'-l + lf/']^/^ 



Hence N = Ar(X) = C{^l + ly/^L^^Pn-^/^ . 

— The upper bound for sup^g^, Var^{Z{a)\lC). Let us define the set Afj. by 



sup( E E "'3,kaj,k'{n ^^g{Xi) Vi,fc(-'^i)V'i,fc' (-'^»)) 



i=l 



- E K.'in>/^?- 



keBi. 



Let us work on the set the complementary of yl^. Using the assumptions 
of boundedness of g, we have 



G = sup E E 

{n-'^\g{Xi)\-%,k{Xi)i^j,k'{Xi)) 



a£C, 



" \k£Bi,K k'eBj,K 



i=l 



< c 



^^P E E { 



^keBj,K k'eBj,K 



i=l 



■ E + sup E 



<C(m+1). 
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Since Wj;{ziZii) = 1 if i = i' and Wj{ziZii) — otherwise, we have 
sup Var']{Z{a)\X) = sup E}{\Z{a)\^\X) 

aGCq a^Cq 



sup I ^ ^ aj^kaj^k'Bj^kBj^k' 



aec. 



sup [ ?i ^ ^ ^ aj^kaj,k''^'^\g{Xi)\ '^ipj^k{Xi)i{jj^k'{Xi')EJ{ziZe) 

keBj^K k'eBj^K i=l i' = l 



aec. 



= n ^ sup I ^ ^ aj^kaj,k'{n 



aec„ 



= n-^G <Cn-^{fi+l). 
Hence Q = Q(X) = Cn-i(^+ 1). 

The obtained values of N and Q will allow us to conclude. For any a; > 0, we 
have 

Vf sup Z{a) >x + C{l+ fiy/'^L^^Pn-^/'^ 

\aeCq J 

= E] I ¥]isnp Zia) > X + C{1 + ix)^^^L'^Pn-^/^\X){lB^ + Ib.)] 

\ aeCq " J 

< ¥]{B^,)+E] [F^isnp Z{a)>x + N{X)\X)] . (4.9) 
\ oec„ / 



The Borel inequality described in Lemma l4.2l implies 

E] (p]{snp Z{a) >x + N{X)\X)] <E"j{exp{~x^/{2Q{X)))). (4.10) 

\ aeCg J 

Moreover, by definition of A^,, we have 

E;; (exp (-xV(2g(X)))) = E}^ (exp i~xy{2Q{X))){lA, + l^c )) 

< P''}{A^)+exp{~nxy{2{fi+l))). (4.11) 

Putting the inequalities ((49| . ([4T0| and (|4TT]l together, for x = 8" V^^^^'^"^^^ 
and fj, large enough, we obtain 

V = ( sup Z{a) > 4:-^fiL^/Pn-^^^ ] 



, aec„ 



< ¥•} sup Z{a) > + C(l + ^y/^L^^Pr 



■ f 
< P] 



-1/2 



, aec„ 



(A^) + P]{B^) + exp (^-Cfl^L^^P/{^l + 1)) . (4.12) 
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Lemma l473l below provides the upper bounds for Pj(^^) and Pj(i3^). 
Lemma 4.3. For [i and n large enough, we have 

max(P^(^J,P;;(B^)) <n-P. 

By the inequahty (|4.12p . the fact that L = [(logn)P/^J and Lemma 1431 for 
/J, large enough, we have 

V < 3n~P. 

Combining the obtained upper bounds for U and V, we achieve the proof of 
Proposition 14. II □ 



Proof of Lemma \4^ . Let us investigate the upper bounds for P^UB^) and P'i(^^) 



• The upper bound for P'j{Bfj_) . First of all, notice that the random variables 

(|V,,fc(Xi)|2.g(Xi)-\ . . . , 1^,- fc(X„)|2g(X„)-i), 
are i.i.d. and, since g is bounded from below, we have 



1. 



So, for any j £ {ji, . . . , J2}, the Hoeffding inequality implies the existence of a 
constant C > such that 



Py(B^) < 2exp(-Cn/x22-2J) < 2 exp (-C7i^22-2j2-) < 
We obtain the desired upper bound by taking fi large enough. 

• The upper bound for P^ [A p). The goal is to apply the Talagrand inequality 



described in Lemma \AA\ Let us consider the set Cq defined by 
C, = {a = (a,- fc) e Z*; ^ |a,- ^1" < l} 

and the functions class T' defined by 

T' = Ih] h{x) = g{x)~^ ^ ^ aj^kaj^k'i'j,k{x)ipj,k' (x), a € Cq 

We have 
/ 

"■^^o \keBj,K k'£Bj_K i=l k£Bj_K 

= sup r„(/i). 
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where r„ denotes the function defined in Lemma |4. II Thus, it suffices to deter- 
mine the parameters T , H and v of the Talagrand inequahty. 

— The value of T. Let /i be a function of J-' . Using the Holder inequahty, the 
fact that g is bounded from below and the concentration property (|2.2|1 . for any 
X £ [0, 1], we find 



keBi 



Hence T = C2K 

— The value of H. The Cauchy-Schwarz inequality implies that 



E'W sup ^ ^ aj^kaj,k''^<^ig{Xi) Vj,fc(-'^i)'/'i,fe' (-'^i) 

y-'^'^i keBj,K k'GBj,K i=l 

< sup[ J2 E Kk\'Kk'\']'^'--- 



aec, 



< 



E E «}[\jZ^^g{X^)-'^,,k{X^)i^,,k'{X.)\' 
kGBj,K fe'eSj.ic V i=l 

E E K](\J2'^3(X^)-'iJjAXi}i^j.k'{X.W 

keBj,K k'i£Bj,K \ j=l 



1/2 



1/2 



.(4.13) 



Since (ei,...,e„) are independent Rademacher variables, also independent of 
X = (Xi, . . . ,X„), the Khintchine inequality and the fact that g is bounded 
from below give 

E'; (^\J2e,giX,)-' V,- , (X, (X, ) 



< CE]iJ2\9iX^)\-^\^PJAX^)\^\i^J.k'{X. 



1=1 

< C\\l/g\\lnE]{\yj,.k{X,)\^\i>j.k'{X^)\' 



(4.14) 



Using the property of concentration (|2.2p and the inequalities (|4.13p - (|4.14p . we 
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find 



" keiSj.K k'ei3j,K 



< C 



|2|2 



fees,-. 



1/2 



Hence H ^ C2^n-^^^. 

— The value of v. Using the fact that g is bounded from below, the Holder 
inequality and the property of concentration (j2.2p . we have 



snp^Var{h{Xi)) < suj> [ \g{Xi)\-^\ ^ ^ aj,fe%-,fc'^j,fe(^i)^j,fe'(^i) 

feeKj.A' k'£Bj,K- 



< C||l/.g|lLsup[ ^ l«,,fe|TE'; I E 



aGC, 



fcGB, 



feezs,- 



Hence v = C^'^K 

Now, let us notice that if i then 



-2j 



For any j e {ji, . . . , J2}, t = 2^^/i with ^ large enough, the Talagrand inequality 
gives 



< P;^(sup( J2 J2 aj,fcaj,fc'(n-iE5(^«)"Vj,fe(^.)V'j,fe'(^0)- 
E ^ 2-V + C2%-i/2) < p(sup r„(/i) > t + C2H) 



keBj,_ 



her 



< exp (-nCi {t^v-'^ A tT^^)) < exp {~nC^i^2-'^^) 

< exp(-nC^l^2-^^^) < n-P. 



This ends the proof of Lemma 14.31 □ 
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